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about  Visit  to  Wright  Laboratory  and  Fall  MRS  Meeting  in  Boston 

On  27th  November-lst  December  I  had  met  with  reseachers  of  Wright 
Laboratory  (OH).  The  aims  of  my  visit  were  a  discussions  of  problems 
connected  with  implementation  of  SPC-94-4097,  partaking  in  Fall  MRS 
Meeting  with  report  and  in  discussions  on  prediction  of  inorganic 
compounds. 

During  my  visit  I  presented  my  report  "Principles  of  the  computer 
design  of  new  inorganic  compoimds"  (Appendix).  I  discussed  with  rese¬ 
arches  from  Wright  Lab.  the  principles  of  the  work  of  programms  of 
the  concept  formations  and  problems  of  their  apply  to  predictions  of 
new  inorganic  compounds  with  predefined  properties  and  choice  of  the 
best  technology  for  synthesis  them.  We  selected  the  classes  of  compo¬ 
unds  for  prediction  according  to  SPC-94-4097. 

The  programme  of  the  collaboration  in  the  area  of  computer  design 
of  the  new  inorganic  compounds  and  development  of  software  for  US  AF 
were  discussed. 


Dr. N .  N. Kiselyova 
A.A.Baikov  Institute  of  Metallurgy 
of  Russian  Academy  of  Sciences 


APPENDIX 

PRINCIPLES  OF  THE  COMPUTER  DESIGN  OF  NEW  INORGANIC  COMPOUNDS 

N.N.  KISELYOVA 
A.A.Baikov  Institute  of  Metallurgy 
of  Russian  Academy  of  Sciences 

117911,  GSP-1,  Moscow,  Leninskii  Prospect,  49,  Russia 
Fax:  (095)-135-8680;  E-mail:  kis@lesr.imet.ac.ru 

The  problem  of  the  computer-aided  design  of  new  materials  has  two 
aspects  [1-3].  The  first  aspect  is  the  prediction  of  compounds,  not 
yet  synthesized,  which  would  have  predefined  properties,  using  infor¬ 
mation  of  the  databases,  and  the  choice  of  the  best  technology  for 
synthesis  them.  The  second  aspect  is  the  search  for  optimum  conditi¬ 
ons  for  the  production  and  for  the  treatment  of  the  new  materials 
with  the  aim  of  optimization  of  the  desired  properties.  In  most  cases 
these  two  problems  can  be  successfully  solved  by  applying  the  cyber¬ 
netic-statistical  approach  that  we  suggested  [4].  The  principal  idea 
of  this  approach  is  the  combination  of  the  databases  on  materials  and 
the  cybernetic  prediction  methods  used  to  solve  the  first  problem  and 
the  statistical  methods  of  design  of  the  required  experiments  (or 
other  methods  of  optimizing)  that  provide  the  solution  for  the  second 
problem. 

A.A.Baikov  Institute  of  Metallurgy  is  now  m^ing  an  information- 
-predicting  computer  system  for  the  practical  realization  of  this  ap¬ 
proach  [1-3].  This  system  is  intended  for  the  data  retrieval  on  the 
known  compounds,  the  prediction  of  inorganic  compounds,  not  yet  syn¬ 
thesized,  and  the  forecasting  of  their  properties.  This  system  is  ba¬ 
sed  on  cybernetic  predicting  subsystem.  Apart  from  the  latter  this 
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jQ^^K^fcerties,  database  on  elements  properties,  knowledge  base, 

^^^^Ponal  processor  and  monitor  (see  Figure  1).  The  system  is 
for  the  IBM  PC  computer. 

?_Predicting  subsystem. 

To  discuss,  in  brief,  the  predicting  subsystem,  it  must  be  pointed 
out  that  a  cybernetic  approach,  developed  by  us,  is  based  on  machine 
learning  strategies. 

The  chemical  foundations  of  the  cybernetic  prediction  of  inorganic 
compounds  are  based  on  Mendelyev‘s  law  which  asserts  that  the  perio¬ 
dic  nature  of  changes  in  the  properties  of  chemical  systems  depends 
on  the  nature  and  properties  of  the  elements  which  m^e  these  systems. 

The  problem  of  predicting  the  new  compounds  was  reduced  to  the  analy¬ 
sis  of  the  multidimensional  array  of  property  values  and  the  column 
vector  of  the  desired  property.  Each  row  corresponds  to  some  known 
physico-chemical  system,  whose  class  is  indicated  by  row  position  of 
column  vector.  The  process  of  analysing  all  this  information,  aimed 
at  finding  the  classifying  regularities,  with  the  help  of  special 
programs  is  referred  to  as  machine  learning.  By  substituting  the  va¬ 
lues  of  the  properties  of  the  components  of  the  unknown  system  in  the 
regularity  thus  found,  it  is  possible  to  determine  the  class.  The  im¬ 
plementation  of  this  stage,  called  the  prediction,  requires  only  the 
knowledge  of  the  values  of  the  component  properties. 

After  testing  many  programs  intended  for  machine  learning  applica¬ 
tions  we  fixed  on  the  class  of  algorithms  in  which  all  classifying 
regularities  could  be  presented  in  the  form  of  a  boolean  expression. 

The  implementation  of  these  machine  learning  logical  algorithms  di¬ 
rectly  involves  decreasing  the  number  of  searches  in  the  process  of 
formation  of  the  classifying  regularities.  The  algorithm  of  the  in¬ 
ductive  concept  formation,  developed  by  Prof.Gladun  and  Dr.Vaschenko 
[5-8]  which  we  have  used  for  nearly  twenty  years,  solves  this  parti¬ 
cular  problem  by  structuring  the  computer  memory  in  the  form  of  se¬ 
mantic  growing  pyramidal  networks.  The  construction  of  such  a  network 
during  the  input  of  the  objects  is  followed  by  learning  it,  i.e.  by 
singling-out  the  control  vertices  determining  the  class  to  which  the 
objects  belong.  The  resultant  classifying  regularities  c^  be  stored 
in  computer  memory  in  the  form  of  a  Teamed  semantic  network  or  an 
equivalent  Boolean  expression  in  which  the  meanings  of  the  component 
properties  make  the  variables.  During  the  prediction  process  the  com¬ 
puter  receives  only  the  atomic  numbers  of  the  elements  or  designati¬ 
ons  of  simple  compounds,  while  the  values  of  the  properties  of  the 
appropriate  elements  or  simple  compounds  are  automatically  extracted 
from  the  database.  After  their  sampling  by  means  of  a  special  program 
they  are  substituted  in  the  classifying  regularity  and  the  researcher 
can  easily  obtain  the  necessary  prediction. 

An  important  problem  of  any  computer  classification  is  the  choice 
of  the  properties  for  the  description  of  physico-chemical  systems. 

This  procedure  can  hardly  be  formalized  completely,  but  the  machine 
learning  algorithm  we  use  [5-8]  automatically  rejects  those  properti¬ 
es  which  have  no  importance  for  the  classification  process.  The  ini¬ 
tial  set  of  the  properties  for  the  computer-aided  analysis  is  prepa¬ 
red  by  the  materials  scientist  and  it  is  desirable  that  the  system  of 
artificial  intelligence  extrapolates  information  from  this  represen¬ 
tative  set  of  characteristics. 

What  properties  of  the  components  do  we  use  for  the  description 
[1-3,  9-22]  ?  In  the  first  place,  the  fundamental  properties  of  che¬ 
mical  elements:  distribution  of  electrons  over  the  energy  shells  of 
isolated  atoms,  ionization  potentials,  ionic,  covalent  or  atomic  ra¬ 
dii,  melting  points,  standard  entropies  of  individual  substances  and 
the  like.  In  the  second  place,  the  properties  of  simple  compounds  - 
oxides,  chalcogenides,  halides,  etc.  -  are  included  as  required  by 
the  composition  of  the  compounds  to  be  predicted. 

Using  these  concepts  we  have  successfully  solved  the  following 
problems  [1-3,  9-22]: 

-  prediction  of  compound  formation  or  non-formation  for  ternary 
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h  of  the  formation  of  the  ternary  and  more  complicated 
s  of  desired  composition; 

Tction  of  phases  with  definite  crystal  structure  types; 

-  estimation  of  phase  quantitative  properties  (critical  temperature 
of  transition  to  superconducting  state,  homogeneity  region  etc.). 

By  applying  the  cybernetic  approach  we  have  predicted  the  formation 
of  thousands  of  new  compounds  in  ternary,  quaternary  [1-3,  9-22]  and 
more  complicated  systems.  These  compounds  were  then  searched  for  new 
semiconductors,  superconductors,  ferroelectrics,  magnets  and  other 
materials  required  for  new  technologies.  The  comparison  of  these  pre¬ 
dictions  with  the  experimental  data,  obtained  later,  showed  (see  Tab¬ 
le)  that  the  average  reliability  of  predicted  phases  equals  about  85 
percent.  Such  a  high  accuracy  for  a  priori  predictions  is  not  ensured 
by  any  other  known  theoretical  method. 

The  best  criterion  of  correctness  for  any  theoretical  method  is  the 
experimental  validation  of  the  prediction.  The  experimental  validati¬ 
on  of  our  theoretical  results  has  shown  that  this  method  of  cyberne¬ 
tic  prediction  is  an  useful  tool  for  the  materials  scientists  and 
chemists  concerned  with  the  a  priori  prediction  of  inorganic  substan¬ 
ces  which  would  have  predefined  properties. 


We  return  to  the  information-predicting  system  [1-3]. 

?_The  database  of  ternary?.  ?Jnorganic  compound  properties?,  is  developed 
by  IMET  during  more  then  10  years  [23].  At  present  this  database  con¬ 
tains  information  on  about  36  000  ternary  compounds  taken  from  more 
then  9  000  publications.  The  database  contains  the  following  informa¬ 
tion  about  the  ternary  systems  and  compounds  (Figure  2).  Some  of  data 
have  been  assessed  by  experts.  This  database  is  a  part  of  integrated 
DB  of  the  inorganic  compound  properties  being  developed  by  IMET. 

?_The  knowledge  base?,  will  store  the  multidimensional  regularities  al¬ 
ready  obtained  for  various  classes  of  inorganic  compounds  for  their 
use  in  the  prediction  of  phases  and  forecasting  of  the  phase  proper¬ 
ties.  The  knowledge  will  be  represented  in  the  form  of  semantic  net¬ 
works  or  of  their  equivalent  conjunctive-disjunctive  logical  expres¬ 
sions. 

?  Conversational  processor?,  will  manage  the  conversation  of  the  user 
wiHi  the  information-predicting  system,  as  well  as  giving  expert  fun¬ 
ction  support  in  the  given  application  domain. 

?  JThe  monitor?,  will  control  the  computation  process  and  provide  the 
interface  between  the  functional  subsystems.  In  addition,  the  monitor 
signals  if  new  experimental  data  contradict  the  available  regulari¬ 
ties.  Such  contradiction  will  be  eliminated  by  including  new  data  in 
the  "learning  step"  and  modifying  the  regularity  in  the  knowledge  ba¬ 
se. 

The  information-predicting  system  is  a  modified  version  of  expert 
systems.  It  differs  from  conventional  expert  systems  [24-26]  in  that 
it  employs  an  unusual  procedure  of  receiving  the  knowledge  from  the 
experts  and  its  subsystem  of  explanations  is  of  rudimentary  nature. 

From  our  experience  we  find  that  any  attempt  to  make  a  chemist  offer 
an  adequate  explanation  of  the  causes  of  the  formation  of  one  phase 
or  another,  or  the  nature  of  some  properties  of  a  compound,  usually 
ends  in  failure.  Instead,  the  chemist  prefers  to  estimate  the  auten- 
ticity  of  data  concerning  the  existence  of  the  given  compound  or  the 
values  of  the  particular  phase  properties.  Therefore  we  abandoned  the 
idea  of  making  the  chemist  outline  the  rules  of  the  formation  of 
inorganic  substances  with  desired  properties  and  decided  to  make  use 
only  of  the  chemist’s  expert  estimation  of  the  data  intended  for  nia- 
chine  learning.  For  this  reason  the  explanations  subsystem  of  the  in¬ 
formation-predicting  system  is  response  to  the  user‘s  inquiry  about 
the  nature  of  one  prediction  or  another  can  only  specify  the  range  of 
the  changes  in  the  properties  of  the  components  that  determine  the 
phase  parent^  class  and  indicate  the  analogs  of  the  predicted  compo¬ 
und  among  the  learning  examples. 
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3r  optimum  conditions  nwi  sth 

Jie  new  materials. 

^  the  prediction  of  new  compound  with  desired  property  it 

_ to  begin  the  synthesis  of  this  compound.  In  this  case,  the 

problem  can  be  formulated  thus:  to  search  the  optimal  technological 
conditions  which  ensure  an  extremum  of  the  desired  property.  Some  of 
the  mathematical  optimizing  method  can  be  used  for  the  solution  of 
this  problem.  For  example  we  succeeded  in  finding  the  optimum  condi¬ 
tions  for  the  synthesis  of  Chevrel  phase  with  composition  Ag  MoS 
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and  maximum  of  the  critical  temperature  of  transition  to  superconduc¬ 
ting  state  predicted  by  us.  In  this  case  we  have  used  the  statistical 
methods  of  the  design  of  the  the  multifactor  experiments.  Now  my 
young  colleagues  N.Kravchenko  and  V. Petukhov  develop  the  program  sys¬ 
tem  for  the  search  for  the  optimum  sections  of  the  acousto-,  electro- 
and  nonlinear-optical  single  crystals  using  the  method  by  Lagr^ge 
and  information  of  our  DB  on  properties  of  these  substances  (Figure 
3)  [27,28]. 


In  that  way  we  have  considered  a  sequential  procedure  of  the  design 
of  new  compounds  with  the  predefined  properties,  namely,  a  database 
—  >  a  cybernetic  predicting  system  —  >  an  optimization  of  the  tech¬ 
nology  of  synthesis  or  treatment  of  predicted  compounds.  The  predic¬ 
tion  of  phases,  the  forecasting  of  the  desired  property,  using  the 
information  from  the  database,  and  the  prediction  of  the  best  type  of 
technology  for  synthesis  or  treatment  can  be  considered  as  a  strategy 
of  the  search  of  substances  with  the  predefined  properties.  The  int¬ 
rinsic  properties  of  the  chemical  elements  and  of  the  simple  comim- 
unds  are  used  for  description  of  multicomponent  physical-chemical 
systems  on  this  stage.  The  known  experimental  data  about  substances 
and  about  the  types  of  technology  for  their  s)aithesis  are  analyzed 
using  computer.  The  design  of  experiments  or  other  methods  of  optimi¬ 
zing  for  the  search  for  optimal  conditions  of  synthesis  or  treatment 
of  certain  substances,  predicted  on  the  first  stage,  can  be  conside¬ 
red  as  the  tactics  of  the  search  for  the  new  materials.  The  resear¬ 
cher  operates  with  the  technological  parameters  on  this  stage. 

We  considered  an  automation  of  the  search  of  the  substances  on  the 
base  of  the  new  information  technologies.  The  researcher  c^  be  deli¬ 
vered  from  the  routine  work  and  becomes  a  creator  using  his  experien¬ 
ce  and  his  knowledge  as  much  as  possible.  Besides  intellectualization 
of  the  scientific  work,  this  approach  allows  to  promote  the  search 
for  new  substances  with  predefined  properties. 
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